Probing Cosmology with Weak Lensing Peak Counts 
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We propose counting peaks in weak lensing (WL) maps, as a function of their height, to probe 
models of dark energy and to constrain cosmological parameters. Because peaks can be identified 
in two-dimensional WL maps directly, they can provide constraints that are free from potential 
selection effects and biases involved in identifying and determining the masses of galaxy clusters. 
We have run cosmological N-body simulations to produce WL convergence maps in three models 
with different constant values of the dark energy equation of state parameter, w = —0.8,-1, and 
— 1.2, with a fixed normalization of the primordial power spectrum (corresponding to present-day 
normalizations of as = 0.742,0.798, and 0.839, respectively). By comparing the number of WL 
peaks in 8 convergence bins in the range of —0.1 < k < 0.4, in multiple realizations of a single 
simulated 3x3 degree field, we show that the first (last) pair of models can be distinguished at 
the 95% (85%) confidence level. A survey with depth and area (20,000 degree 2 ), comparable to 
those expected from LSST, should have a factor of « 50 better parameter sensitivity. We find that 
relatively low-amplitude peaks (k ~ 0.03), which typically do not correspond to a single collapsed 
halo along the line of sight, account for most of this sensitivity. We study a range of smoothing 
scales and source galaxy redshifts (z s ). With a fixed source galaxy density of 15 arcmin -2 , the best 
results are provided by the smallest scale we can reliably simulate, 1 arcminute, and z 3 — 2 provides 
substantially better sensitivity than z s < 1.5. 

PACS numbers: 98.80.Cq, ll.25.-w, 04.65.+e 



I. INTRODUCTION 

Weak gravitational lensing (WL) has emerged as one 
of the most promising methods to constrain the param- 
eters of both dark energy (DE) and dark matter (DM) 
[jQ. This method exploits the fact that images of dis- 
tant galaxies are distorted as their light is lensed by in- 
homogeneities in the foreground gravitational potential. 
DE and DM alter the expansion history of the universe, 
as well as the rate at which cosmic dark matter struc- 
tures grow from small initial perturbations, and thereby 
modify the statistical features of the WL signal [5] . Sev- 
eral deep astronomical surveys plan to cover large regions 
( >10 3 degree 2 ) of the sky in the future, detecting mil- 
lions of background galaxies, and measuring their shapes 
precisely, in order to reveal the statistical features of the 
WL distortion. Examples include the Panoramic Survey 
Telescope And Rapid Response System (Pan-STARRS) 
3 , the Kilo-Degree Survey (KIDS) 1 , the Dark Energy 
Survey (DES) 2 , and the survey by the Large Synoptic 
Survey Telescope (LSST) [4] [5] covering half of the sky. 

Existing theoretical studies assessing the utility of such 
surveys to probe the properties of DE and of DM can be 
roughly divided into two types. The first type analyzes 
the WL distortion primarily on large scales, caused by 
fluctuations in the foreground mass distribution on corre- 



1 http://www.strw.lcidenuniv.nl/ kuijkcn/KIDS 

2 https:/ /www. darkenergysurvey.org 



spondingly large scales. On these scales, the fluctuations, 
and hence the WL distortion patterns, are linear or quasi- 
linear, and can be readily and reliably interpreted using 
existing semi-analytic tools [HI El El El LTHl [III LTU H31 HI] • 
These studies (for example, those computing the two- 
dimensional power spectrum of the shear) have, in fact, 
been pushed into the non-linear regime, but the depen- 
dence of the non-linear power-spectrum on cosmological 
parameters has not been verified in simulations, to an 
accuracy necessary to take full advantage of the future 
surveys. 

The second type analyzes the distortion on smaller 
scales, where it is dominated by the nonlinear mass fluc- 
tuations. In the limit of extremely non-linear, high-a 
WL shear peaks, the individual peaks tend to be caused 
by a single, discrete, massive collapsed object - a galaxy 
cluster - in the foreground [T5J [THl HI]. The abun- 
dance evolution and spatial distribution of galaxy clus- 
ters in different cosmologies can then be studied semi- 
analytically (calibrated by simulations), and has been 
shown to place very strong constraints on DE parame- 
ters, because it is exponentially sensitive to the growth 
rate of dark matter fluctuations. Several studies have ex- 
plored constraints expected from samples of tens of thou- 
sands of galaxy clusters in Sunyaev-Zeldovich and X-ray 
surveys (e.g. jTSJ [HO [20]), and from the still larger sam- 
ples of clusters expected to be detectable in large weak 
lensing surveys [2T ] 122 ] 123] . 

The amount of statistical information in these two 
regimes appears to be comparable |24j : furthermore, the 
information from the two approaches is complementary, 
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and using them in combination significantly improves 
DE constraints compared to using either method alone 

[SUES]. 

The chief difficulty with using galaxy clusters identi- 
fied in WL maps, however, is that the correspondence 
between peaks in WL maps and collapsed clusters is 
not one-to-one. A significant fraction of WL peaks 
are caused by chance alignments of multiple non linear 
structures along the line-of-sight. Conversely, a non 
negligible fraction of real clusters are missed because they 
do not show a significant WL peak, due to a chance can- 
cellation of their shear signal by under-dense regions in 
the foreground. In principle, the statistical correspon- 
dence between WL peaks and clusters can be precisely 
quantified ab-initio in numerical simulations, but in prac- 
tice, this remains a challenge to the required level of ac- 
curacy. In particular, the completeness and purity of 
cluster selection are of the order of 60 — 80% for peaks 
selected to lie above a ~ 5<r convergence threshold, and 
both become progressively worse for lower S/N peaks 
P21 HU [Ml I2H I2H] • Systematic selection errors are there- 
fore likely to dominate over the statistical constraints for 
the majority of a WL-selected cluster sample. 

A natural way to circumvent such selection effects is 
to study the statistics of non-linear peaks in WL maps 
directly. This method, however, does not lend itself to 
straightforward mathematical analysis - it requires nu- 
merical simulations and has been relatively much less 
well-explored (e.g. (29J |30]). In this paper, we study 
WL peaks in simulated maps, and their usefulness to con- 
strain DE properties. More specifically, we use a set of 
N-body simulations, combined with ray-tracing, to cre- 
ate convergence maps for three different values of the 
DE equation of state, w — p/p. We then identify peaks 
in such maps, and tabulate them as a function of their 
height. We explicitly avoid a reference to clusters, cluster 
masses, or mass functions - we utilize only the directly 
observable height of the maximum of each convergence 
peak. This method should therefore provide one of the 
most robust cosmological probes, free from potential se- 
lection effects and mass measurements biases affecting 
cluster identifications. 

For increasingly negative values of w, DE starts to 
dominate at increasingly later cosmic epochs, and one 
expects an excess of high peaks in such models, com- 
pared to those with less negative w. In this first paper, 
we perform a feasibility study, to asses whether changes 
in w indeed have a measurable effect on the number of 
peaks. To this end, all cosmological parameters, other 
than w, are held fixed at their fiducial values. 3 In a 



3 It is important to note here, however, that we fix the primor- 
dial amplitude of the density perturbations; this results in a in- 
dependent value of the present— day normalization erg. In the 
rest of this paper, whenever we compare cosmologies with two 
different values of to, it should be remembered, even when not 
explicitly stated, that these two cosmologies also have different 



follow-up paper, we will vary other cosmological param- 
eters, thereby exploring degeneracies between parameter 
combinations and obtaining more realistic, marginalized 
constraints, on each parameter. 

While the cosmological dependence of the cluster 
counts has been studied extensively, it is not a priori clear 
whether the full catalog of all WL peaks contains a com- 
parable amount of information. A large number of peaks 
caused by large-scale structure, with no w-dependence, 
would constitute pure noise, and could wash out the the- 
oretically known difference from peaks due to clusters. A 
strong w-dependence in the number of non-cluster peaks, 
in the sense opposite to that of the cluster peaks, could be 
even more detrimental, since this would explicitly cancel 
some of the signal contained in the cluster counts. Con- 
versely, a strong ^-dependence in the number of non- 
cluster peaks, in the same sense as the cluster counts, 
could significantly enhance the sensitivity. This remains 
a realistic possibility, since the dependence of the number 
of peaks caused by line of sight projections on cosmology 
has not yet been studied. 

An intuitive expectation is that large-scale structure 
filaments are expected to contribute more strongly to rel- 
atively low height peaks, and will be more prominent for 
source galaxies at higher redshift. This is because most 
filaments are likely to line up only partially, thus creat- 
ing a weaker signal than those caused by massive col- 
lapsed clusters, and because the probability to find large 
(~ lOOMpc) filaments that are lined up along the line of 
sight will increase significantly at larger distances. We 
perform our simulations at high mass-resolution (down 
to less than 5 x 10 9 M Q per particle), allowing us to re- 
solve relatively low significance peaks in our convergence 
maps, where such projections of large scale structure are 
more common,. We also place the source galaxies out to 
redshifts as high as z s = 2. 

To keep the computational cost of the simulations rea- 
sonable, we sacrifice the angular size of our simulated 
field of view, limiting ourselves to creating WL maps cov- 
ering 2.86 x 2.86 degrees (for simplicity, we will hereafter 
refer to these as 3 x 3 degree fields). 

In these respects, our work differs from the recent re- 
lated study by [22j , which considered the projected mass 
function out to a lower redshift (to z — 0.3 — 0.6), with 
simulations at significantly lower resolution, to demon- 
strate that the 2D projected mass function obeys a scal- 
ing with f2 m and erg that is similar to that of the three- 
dimensional halo mass function ([25] also do not vary 
to). Our work shows that the depth of the survey (at 
least to z ~ 2) is vital for distinguishing between models 
with different to, and that the main distinguishing power 
comes from medium height convergence peaks, with con- 
vergence k ~ 0.03 (on maps with 1 arcmin smoothing). 

Another recent study, [3U] explored probing cosmology 



values of rrg. 
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with a related, simple statistic: the high tail of the one- 
point probability distribution of the convergence. The 
total area above a fixed convergence threshold includes a 
sum of the area of all peaks lying above this threshold, 
and we therefore expect it to capture less information 
than peak counts. Nevertheless, [30 have found that this 
statistic can already place constraints on DE properties, 
comparable to the traditional linear two point correlation 
function of shear or to the limited use of the nonlinear in- 
formation from counting galaxy clusters [50] . Our paper 
improves on this preliminary study, by employing direct 
simulations, rather than fitting functions, for the con- 
vergence and its dependence on cosmology, and also by 
examining a potentially more powerful statistic. 

As our paper was being prepared for submission, we 
became aware of an independent preprint, addressing the 
dependence of WL peaks on the background cosmology 
3 1 J . Their motivation is essentially identical to our own, 
and the conclusions reached are quite similar. However, 
the two studies are complementary, in that |31j study 
the peak dependence in the (fl m ,as) plane assuming a 
ACDM cosmology, while we vary the dark energy equa- 
tion of state parameter w (with corresponding changes in 
erg). There are additional differences, such as the specifi- 
cations in the simulations that were used, and the statis- 
tical analysis methods, which we briefly discuss further 
in § |TVFl below. 

The rest of this paper is organized as follows. In § II, we 
describe our calculational procedure, including the set- 
up and analysis of the simulations. In § III, we test our 
computations by comparing our numerical convergence 
power spectrum to semi-analytic predictions. In § IV, we 
present and discuss our main results on the peak counts. 
Finally, in § V, we summarize our main conclusions and 
the implications of this work. 



II. METHODOLOGY 

This section describes the procedure to create the sim- 
ulated weak lensing maps, from generating the matter 
power spectrum with CAMB and scaling it to the start- 
ing redshift of the N-body simulation, to generating the 
initial conditions for the simulation, ray-tracing the weak 
lensing maps from the simulation output, as well as post- 
processing and analyzing these maps. 

A. N-body Simulations 

For this study, we generated a series of cold dark mat- 
ter (CDM) N-body simulations with the code GADGET- 
2 [32 , which include DM only; baryons and neutri- 
nos are both assumed to be absent. We have modi- 
fied GADGET-2 for use with arbitrary equation of state 
parameter w, but in this work, we restrict ourselves to 
constant w. As the reference model, we chose a ACDM 
model with parameters close to the best-fit values from 



WMAP5, i.e. with cosmological constant J7a = 0.74, 
matter density parameter £l m = 0.26, Hubble constant 
h = 0.01 x (Ho/km s^Mpc" 1 ) = 0.72. We fix the am- 
plitude of curvature fluctuations = 2.41 x 10~ 9 at 
the pivot scale k = 0.002Mpc~ 1 . This results in differ- 
ent values of the present-day normalization, as, for the 
different dark energy models. In the three models we 
consider with w = —0.8,-1, and —1.2, the values are 
a 8 = 0.742, 0.798, and 0.839, respectively. 

For each of the three cosmologies, with all parameters 
other than w and as fixed, we ran two independent N- 
body simulations, with two different random realizations 
of the initial particle positions and velocities. The simu- 
lations used 512 3 particles, in a box with a comoving size 
of 200/i~ 1 Mpc, starting at redshift z = 60. This corre- 
sponds to a mass resolution of m — 4.3 x 10 9 Af Q per dark 
matter particle. The Plummer-equivalent gravitational 
softening of the simulations was set to epi — 7.5/i -1 kpc 
(comoving). Fixing the softening length in physical 
(rather than comoving) units is not warranted, since for 
a pure CDM simulation without gas, no further physical 
scale enters. 

The initial conditions (ICs) for the simulations were 
generated with an IC generator kindly provided by Volker 
Springel. The matter power spectrum at z = 60, 
which is an input for the IC generator, was created with 
the Einstein-Boltzmann code CAMB [33], a modifica- 
tion/extension of CMBFAST [51]. GADGET-2 does not 
follow the evolution of the radiation density, which, start- 
ing with the correct power spectrum at z = 60, would ac- 
cumulate to a ~ 2% error in its normalization by z = 0. 
To eliminate this small bias, we used CAMB to produce 
the power spectrum at z = and scaled it back to z = 60, 
with our own growth factor code, with the radiation com- 
ponent absent. This leads to a small deviation from the 
true power spectrum at the starting redshift, but the 
match will become better at the lower redshifts which 
are more important for our analysis. 

The output of the simulations consists of particle po- 
sitions at various redshifts between z = and z = 2. 
Similar to choices in most previous works (e.g. [37]), the 
output redshifts were chosen to span comoving intervals 
of 70/i _1 Mpc along the line of sight in the ACDM cos- 
mology. For simplicity, the same redshifts were used in 
the other cosmologies. This results in a small difference 
in comoving separation between the planes in these cos- 
mologies, but this has negligible impact on our results. 

Most simulations and all post-processing analysis were 
run on the LSST Cluster at the Brookhaven National 
Laboratory, an 80-CPU Linux cluster with 80 GB of 
memory. Some simulations were run on the Intel 64 
Cluster Abe, with 9600 CPUs and 14.4 TB of memory 
at the National Center for Supercomputing Applications 
(NCSA), a part of the NSF TeraGrid facility. 
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B. Weak Lensing Maps 

We use our own codes to produce and analyze weak 
lensing maps from the N-body simulation output. This 
subsection describes the procedures and algorithms, as 
well as the testing of the new codes. 



1. Potential Planes 

As mentioned above, the simulation outputs are pro- 
duced at every 70/i -1 Mpc, which results in an as 2/3 
overlap between consecutive 200/i~ 1 Mpc cubes. We pre- 
pare the N-body simulation output for ray-tracing by 
truncating the snapshot cubes along the line of sight in 
order to remove this overlap, and then project the parti- 
cle density of each cube on a 2D plane p perpendicular to 
the line of sight at the center of the plane, located at the 
same redshift as the original cube. We use the triangular 
shaped cloud (TSC) scheme [35 to convert the individual 
particle positions in the simulations to a smoothed den- 
sity field on the 2D plane. The projected density contrast 
field on plane p, 5 p 2D \x) — J^ p ^ dy[p(x,y)/p — 1] is cal- 
culated on a 2D lattice of 2048 x 2048 points (here x 
represents the coordinates in the plane perpendicular to 
the line of sight; y is the radial coordinate, and y p is the 
position half-way between the p th and (p+ 1) planes). 
We then use the 2D Poisson equation 



3fL 



i ^e D \x) 



(1) 



to calculate the 2D gravitational potential ^ p on plane 
p. This equation is solved easily in Fourier space, in 
particular since the density contrast planes have periodic 
boundary conditions. The resulting potential plane is 
then stored in FITS format [36 for further use by our ray- 
tracing code. This multi-step approach has proven useful 
to save memory, and for re-using intermediate steps of 
the lensing map construction. 
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where I is the identity matrix, f(x) and a(x) denote the 
comoving angular diameter distance and the scale factor, 
evaluated at the comoving coordinate distance x, an( l the 
so-called optical tidal matrix, 
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contains the second spatial derivatives of the gravita- 
tional potential if? on plane p. The derivatives are to 
be evaluated at the point on plane p intersected by the 
light ray. Starting with A = I, equation ^ can be 
solved to find A„ for any n. 

The physical weak lensing quantities, the convergence 
k and the two components of the shear, 71 and 72, are 
extracted from the distortion tensor via 



A = 



1 - AC - 71 -72 - UJ 
-72 + LO 1 — K + 71 



(5) 



The observable quantities, the magnification \i and the 
distortion 5, are given by 
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where g — 7/(1 — k) is the reduced shear. In the weak 
lensing limit these relations simplify to fi = 1 + 2k, and 
£=27. 

For simplicity, for the purpose of this paper, we will 
work directly with the maps of the theoretically predicted 
convergence k. In practice, in the absence of external size 
or magnification information, neither the shear, nor the 
convergence is directly observable, but only their com- 
bination, the reduced shear g. One can, in principle, 
convert one quantity to the other; in practice, on small 
scales, where lensing surveys get much of their constrain- 
ing power, the errors introduced by this conversion will 
not be negligible, and must be taken into account when 
extracting cosmological parameters from actual data |38j . 



2. Ray- Tracing 

To create weak lensing maps from the N-body sim- 
ulations, we implement the two-dimensional ray-tracing 
algorithm described in [37 . We follow 2048 x 2048 light 
rays through a stack of potential planes, which, as men- 
tioned above, are spaced 70/i~ 1 Mpc apart in the case of 
the ACDM cosmology. Starting at the observer, we fol- 
low the light rays backward to the source galaxies. At 
each lens plane n, we calculate the distortion tensor and 
the gravitational lensing deflection from the equations: 



n-l 

E 
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C. Source Galaxies, Smoothing and Noise 

The point -particles in the N-body simulations repre- 
sent lumps of matter with a finite spatial distribution. 
While gravitational softening is included in GADGET-2 
to prevent unrealistically close encounters, the simula- 
tions still follow the assembly of point particles. As a 
result, several additional types of smoothing are neces- 
sary to produce weak lensing maps. The TSC scheme, 
which is used to place particles on the density contrast 
planes, generates smooth two-dimensional density con- 
trast distributions, thus avoiding unnaturally large de- 
flections when a light ray would otherwise pass close to an 
individual particle. In the ray-tracing step, as light rays 
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pass through arbitrary points on the potential planes, we 
employ the same TSC kernel to compute the potential at 
the ray intersection points. 

Since the orientation of the disks of the galaxies with 
respect to the observer is unknown, this intrinsic ellip- 
ticity appears as noise in the maps. Following the inter- 
pretation of [39] in [24], we take the redshift-dependent 
r.m.s. of the noise in one component of the shear to be 



aJz) = 0.15 + 0.035z 



(7) 



and perform the calculation analogously to |39j but with 
a square top-hat function representing our pixels. The 
resulting convergence noise correlation function for our 
pixclized map where x and y are the discrete 2D pixel 
coordinates is: 



(^noise (•£) ^noisc 



n„aiA 



(8) 



where 5gg is the Kronecker symbol, n ga [ is the surface 
density of source galaxies, and A is the solid angle of a 
pixel. For simplicity, in our analysis we assume that the 
source galaxies are located on one of three source planes, 
at fixed redshifts (z s = 1, 1.5, 2), with n ga i = 15 arcmin -2 
on each source plane. In combination, this would yield 
a total surface density of n ga i = 45 arcmin -2 . For 
comparison, we note that this is larger than the value 
« 30 arcmin -2 (with a redshift-distribution that peaks 
at z s ~ 1), expected in LSST. However, we do not have 
sufficient simulation data to perform accurate tomogra- 
phy, and therefore, our results below utilize only a sin- 
gle source plane, with a conservative number of n ga i = 
15 arcmin" 2 , rather than the full n ga i = 45 arcmin - (see 
below). Thus, we add noise to the convergence in each 
pixel, drawn from a Gaussian distribution with the vari- 
ance given in equation (|8l. Note that the noise between 
different pixels is uncorrelated. 

In order to investigate the information content in peaks 
defined on different scales, once noise in each pixel is 
added, we apply an additional level of smoothing, with a 
finite 2D Gaussian kernel, on scales between 6q — 0.5—10 



arcminutes (see Sec. II D for exact values). The Gaussian 



in practice, we can equivalently make two passes through 
the map with ID Gaussian kernels, oriented once in each 
of the two orthogonal dimensions of the map. This saves 
computation time, especially for large smoothing scales. 

The noise in the convergence o\ oise introduced by in- 
trinsic ellipticity, after the Gaussian smoothing, becomes 



(9) 



noise o n2 

2n0£,n ga i 



which is the equivalent of ([8]), but with a Gaussian 
smoothing instead of a top-hat function [39 , and depends 
on the redshift distribution of the source galaxies, as well 
as on the smoothing scale 0q applied to the maps. In gen- 
eral, the noise level should be computed as the weighted 
average of the redshift-dependent shear noise over 
the redshift distribution of the source galaxies: 



n gal JO az 



(10) 



kernel has the nice property of being decomposable, thus 



Since in our case, for simplicity, we assume that the 
galaxies are located on three discrete source planes, 
dn/dz is simply a delta function, at z s = 1,1.5, or at 
z s = 2 (or a sum of delta functions, for the purpose of 
redshift tomography). 



D. Statistical Comparisons of Convergence Maps 

The main goal of this paper is to discern between differ- 
ent cosmologies by counting the convergence peaks. Since 
the maps have been smoothed, as mentioned above, it is 
sufficient to identify peaks in the maps as pixels which 
correspond to local maxima, in the sense that all eight 
of their neighboring pixels have a lower value. For illus- 
tration, we show an example of the convergence maps 
generated in two of our cosmologies, before and after the 
addition of noise and smoothing, in Figure [T] The panels 
with w = —1 exhibit visibly more structure at high con- 
vergence, consistent with the expectation that DE starts 
to dominate later in this model, allowing more time for 
growth. 



Once the peaks are identified, their heights are 
recorded in a histogram. This histogram is divided into 
a reasonable number of bins (the choice in the number 
of bins will be discussed below) , and a vector of values is 
formed from these bins. We will also attempt to utilize 
the source galaxy redshifts for tomography, and to com- 
bine different smoothing scales, so that each peak is fur- 
ther associated with a value of z s and 9q. Finally, peak 
identification is repeated in each realization of a given 



cosmology. We therefore denote the number of peaks in 
bin i with N^Jg , where / and r indicate cosmology and 
realization, respectively, z s indicates the redshift of the 
sources and 9q is the smoothing scale applied to the final 
map. To simplify notation we hereafter drop the last two 
lower indices and merge them into the single index i. 

In order to assess the statistical significance of any dif- 
ference in peak counts, we construct the covariance ma- 
trix C^> for cosmology / of the counts in the different 



w=-1 raw original 



w=-0.8 raw original 
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FIG. 1: Examples of 3 x 3 degree convergence maps from our 512 3 particle N-body simulations in two different cosmologies. 
The left column shows maps for w — —1, the right column for w = —0.8. The top row shows the raw maps, with no smoothing 
or noise; the bottom row shows the maps after the addition of random ellipticity noise and 1 arcmmute smoothing. The panels 
with w = — 1 exhibit visibly more structure at high convergence (yellow and red in the bottom panels); voids (regions of low 
convergence, shown in blue) also appear more numerous in this cosmology. 



height-, redshift-, and smoothing-scale bins i and j, 



C 



(/) 



1 



R 



R 



l <H 

r=l 



(f;r) 



(11) 



R- 1 E"=i ^ /;r) is the average number of 



where 

peaks in bin i in the fiducial model /, and the summation 



in equation (111 is over the total number of realizations 



R constructed for every cosmology. 



We use 200 such realizations. However, as noted above, 
we have run only two strictly independent realizations of 
each cosmology, i.e., two different simulations with dif- 
ferent realizations of the initial conditions. Thus we pro- 
duce additional pseudo-independent realizations by the 
standard procedure of randomly rotating, shifting, and 
slicing the data in the simulation output cubes at each 
redshift, prior to generating the 2D density planes. Our 
box-size, 200/i~ 1 Mpc is much larger than the few co- 
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moving Mpc correlation length of large-scale structures. 
As mentioned above, the data cubes are truncated at 
each output, so that we utilize only a radial slice cov- 
ering about l/3rd of its length; the 2.86 x 2.86 degree 
field furthermore utilizes only a fraction of this slice. As 
a result, at intermediate redshifts, where the lensing ker- 
nel peaks, each realization utilizes about a twelfth of the 
volume of the snapshot cube, allowing us to access, in 
each N-body run, effectively ~ 12 truly independent re- 
alizations of each lens plane. At the very highest red- 
shift, the situation becomes worse, with up to a third 
of the volume of the data cubes used. However, due to 
the distance-dependence of the lensing kernel, the planes 
near z s = 2, with these heavily sampled boxes, do not 
contribute much to the final lensing signal. The random 
rotations and shifts will create many more than 12 essen- 
tially independent random realizations of the incidental 
line ups of multiple small clusters and filaments. On the 
other hand, the very largest clusters - which may create 
a substantial lensing peak by themselves - will tend to 
reappear in multiple realizations, and could lead to an 
underestimation of the variance. Fortunately, our main 
results come from medium-height peaks, which typically 
are not caused by a single cluster, so that any such un- 
derestimation at the high-convergence end contributes 
relatively little to our final distinguishing power. 

We use the (inverse of) the covariance matrix to com- 
pute a value of X/'( r ) f° r each realization r of a "test" 
cosmology, labeled by /', against the expectations in the 
fiducial cosmology, labeled by /, 



,(r) = dN (/ ' ;r) (C (/) )- 1 dN (/ ' ;r) (12) 
= ^^(C^dN^, (13) 



where dN^ = JV i u '' ' — ' is the difference between 
the number of peaks in bin i in realization r of the test 
cosmology, and the average number of peaks in the same 
bin in the fiducial cosmology. 

The resulting probability distributions for Xf,(r), us- 
ing 200 realizations of each cosmology, fixed source red- 
shift z s = 2 and smoothing scale Qq — 1', and 8 peak 
height bins, is shown in Figure [2] In the top left panel, 
the realizations were drawn from the simulation of the 
fiducial w = — 1 model itself, which was used to com- 

— (/) 

pute the mean number of peaks N i . Our simulated 
X 2 distribution is very close to a genuine x 2 distribu- 



r(/';r) 



tion with n — 8 degrees of freedom (shown by the black 
curve). In the bottom left panel, the realizations were 
again drawn from the w = — 1 cosmological model, but 
from the simulation with an independent realization of 
the initial conditions. The good agreement with the top 
left panel tests the accuracy of our covariance matrix (in 
particular, if the covariance had been underestimated, 
then this second realization would have produced larger 



X values; see Appendix VI). The two right panels show 



X distributions computed in the realizations of differ- 
ent cosmologies, with w = —0.8 (top right panel), and 
w = —1.2 (bottom right panel). The w = —0.8 and 
ACDM cosmologies are distinguished at the 68% (95%) 
confidence level in 84% (39%) of the realizations; the 
w = —1.2 cosmology is distinguishable from ACDM at 
the same confidence in 61% (24%) of the realizations. 

For reference, the above confidence levels can be com- 
pared to a simpler estimate of the statistical distinction 
between the two pairs cosmologies, based on the value 
of Axp f > computed using the mean number of peaks in 
each bin (averaged over all realizations), 
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dN (/'J)((7(/))-i dN (/',/) ( 14 ) 
Y^dN^'^iC^dN^^, (15) 



where dN^' J) = N^^ 



f (f) 



N} J ' is the difference between 
the average number of peaks in bin i in cosmology /' 
and in cosmology /. We find Ax 2 = 8.35 between 
w = —0.8 and w = —1 (when then same realization of 
the initial conditions are used in both cosmologies) and 
Ax 2 = 9.54 (when different realizations are used). The 
covariance matrix is calculated from the w — —1 cos- 
mology in both cases. Assuming that the likelihood dis- 
tributions are Gaussians, these values would correspond 
to a 2.8cr and 3er difference between the two cosmolo- 
gies. This is in good agreement with the 95% exclu- 
sion of the mean of the x 2 distribution we find from the 
individual realizations in the w = —0.8 model - indi- 
cating that interpreting the A% 2 directly as a probabil- 
ity would only slightly overestimate the statistical dif- 
ference between the two cosmologies. The correspond- 
ing value between the w = —1.2 and w = — 1 models 
is A^2 = 4.65 (with the same realization of the initial 
conditions), again confirming that the true likelihood is 
only somewhat weaker than this simple estimate for the 
exclusion of the w = —1.2 model. 



In the above analysis, we have chosen conver- 
gence bins whose width was varied, so that they 
contain approximately equal number of counts 
(« 350 peaks/bin at 1 arcmin smoothing). More 
specifically, the bin boundaries are located at k = 
-0.084, 0.0022, 0.015, 0.024, 0.032, 0.041, 0.052, 0.068, 0.41. 



While equal- width bins would perhaps be a more natural 
choice, the number of peaks falls off exponentially with 
peak height, and this would have the undesirable result 
of having many almost empty bins, leading to large x 2 
values excessively often (overestimate of improbability 
of unlikely events), and strong deviations from a true 
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FIG. 2: Normalized probability distribution of x , computed by comparing the peak counts in individual 3x3 degree convergence 
fields to the ensemble average in the fiducial w = —1 model. Peaks were divided into 8 bins by height. The source galaxies were 
placed at redshift z s — 2, and a Gaussian smoothing of 6a = 1 arcmin was applied after the addition of ellipticity noise. In the 
top left panel, the realizations were drawn from the w = — 1 simulation, the average of which was used to construct the fiducial 
model. The black curve shows a genuine \ 2 distribution with n — 8 degrees of freedom. In the bottom left panel, the realizations 
were again drawn from a w = — 1 model, but from a simulation with different random initial conditions, illustrating that these 
have a negligible effect on the result for most of the distribution, and testing the accuracy of our covariance matrix. The two 
right panels show \ 2 distributions in different cosmologies, with w — —0.8 (top right panel), and w = —1.2 (bottom right panel). 
The w = —0.8 and ACDM cosmologies are distinguished at the 68% (95%) confidence level in 84% (39%) of the realizations; 
the w — —1.2 cosmology is distinguishable from ACDM at the same confidence in 61% (24%) of the realizations. 



X 2 distribution (since the peak counts do not have 
a Gaussian distribution, especially for bins with low 
average counts). 



III. WEAK LENSING POWER SPECTRUM 

Because our pipeline to produce the WL maps is newly 
assembled, GADGET-2 was modified to allow w ^ — 1, 
and the ray-tracing and WL map analysis codes were 
newly written, we performed a consistency check, by 
comparing the power spectrum of the convergence in 
our maps with the power spectrum derived from semi- 
analytical predictions. This comparison also determines 
the range of scales which is adequately represented by our 
simulations due to resolution and box size. In Figure [3] 
we show the power spectrum of convergence, averaged 
over 200 realizations of our simulations (solid curves), to- 
gether with the theoretical expectations (dashed curves) . 
The latter were obtained by direct line-of-sight integra- 
tion using the Limber approximation |40j . and using fit- 



ting formulas for the nonlinear 3D matter power spec- 
trum from [U]. From top to bottom, the three sets of 
curves correspond to w = —1.2 (green curves), w = — 1 
(red curves), and w = —0.8 (blue curves). 

We find significant deviations from the theoretical 
power spectra on both small and large scales, as expected. 
On small scales, with £ > 20, 000, we are missing power 
because of the finite resolution on the mass planes and on 
the ray-tracing grid. On large scales, with I < 400, the 
spectrum is suppressed because of the finite size of the 
simulation box. These two wave numbers thus set the 
range, from ~ 1 arcmin to ~ 1 degree, on which the ab- 
solute value of the converge power spectrum is accurately 
captured. 

While we can not trust our results beyond these lim- 
its in the subsequent analysis, it should be noted that 
the ratio of the power spectra in different cosmologies 
is preserved accurately, even where the simulated power 
spectra start to deviate from the theoretical predictions. 
Since we are mainly interested in the comparison between 
the cosmologies, this makes our results even more robust. 
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FIG. 3: Convergence power spectrum P{1), as a function 
of £ — 2n/9, derived in the three cosmologies (solid blue: 
w = —0.8, solid red: w — — 1, solid green: w = —1-2) from the 
raw un-smoothed 3x3 degree maps, without ellipticity noise. 
The simulated power spectra have been averaged over 200 real- 
izations in each cosmology. Also shown are the corresponding 
semi-analytical predictions based on the matter power spec- 
trum of \41\ and using the Limber approximation \40\ ( dashed 
curves). The drop around I ~ 20,000, which corresponds to 
~ 1 arcmin, is due to the finite resolution of the simulations. 
The power is also suppressed on large scales, because of finite 
box size. Note that the ratios of power spectra in the three cos- 
mologies are accurate in the simulations over a wider range 
than the absolute power. 



The fundamental quantity we are interested in is the 
number of peaks in different convergence bins. This 
quantity is shown in Figure [4] in the three different cos- 
mologies, derived from convergence maps with a single 
source galaxy plane at z s — 2, after the addition of ellip- 
ticity noise and Gaussian smoothing with Qq = 1 arcmin. 
From top to bottom (at the peak of the distribution), 
the three curves correspond to w = —0.8 (blue curves), 
w = —1 (red curves), and w = —1.2 (green curves). 
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Another consistency check we perform is a comparison 
between the number of peaks above a certain threshold 
to the number of clusters creating lensing peaks above 
that same threshold. This comparison will presented in 
Sec. [TO] below. 



FIG. 4: Number of convergence peaks in intervals [k, k + Ak], 
with An — 0.0161 in 3 x 3 degree fields with source galaxy 
redshift z s = 2, 1 arcmin smoothing, and intrinsic elliptic- 
ity noise added. From top to bottom ( at the peak of the dis- 
tribution), the three curves correspond to w — —0.8 (blue), 
w = — 1 (red), and w — —1.2 (green). The total number of 
peaks is 2477, 2432, and 2401, respectively. Notice that the 
lower distributions have wider wings. 



IV. RESULTS AND DISCUSSION 

We have found several important quantitative and 
qualitative results in our study. We first discuss the dif- 
ference in peak counts between the cosmologies - in the 
total number of peaks, as well as in the number of peaks 
in individual convergence bins. We then present our in- 
ferred sensitivity to distinguish these cosmologies, and 
extrapolate this sensitivity from our 3x3 degree field to 
an LSST-like survey. We discuss the validity of such an 
extrapolation by studying the variance in peak counts as 
a function of angular scale. In the next subsection, we 
comment on correlations between peaks in different bins, 
with source galaxies at different redshifts, and with differ- 
ent angular smoothing scales, as well as cross-correlations 
among these. Finally, we briefly contrast our results to 
other predictions for an LSST-like WL survey, namely 
constraints from the weak lensing shear power spectrum, 
the PDF of k found in [3D], as well as cluster number 
counts. 



As this figure demonstrates, the distribution of peak 
heights becomes narrower with increasing w. 

The total number of convergence peaks in the noisy, 
smoothed maps is N total = 2432 ±30, N total = 2477 ±30, 
and N tota i = 2401 ± 34 for w = -1, w = -0.8, and 
w = —1.2, respectively. Formally, this is a 1.5a dif- 
ference for w — —0.8 and a somewhat weaker differ- 
ence of lcr for w — —1.2. The same numbers for the 
smoothed noiseless maps (still with 1 arcmin smooth- 
ing) are N tot ai = 1646 ± 24, N total = 1654 ± 28, and 
Ntotal = 1644 ± 24, respectively. Interestingly, the addi- 
tion of ellipticity noise boosts both the total number of 
peaks (by ~ 50%; note that a similar increase was ob- 
served by [16]), and the significance in the difference in 
peak counts between pairs of cosmologies. Apparently, 
the number of new peaks introduced by the presence of 
noise is, itself, w-dependent, in a way that helps in dis- 
tinguishing the two cosmologies. 

While this result may first sound surprising, the ad- 
dition of noise is known boost the discriminating power 
of related observables. For example, scatter in the mass- 
observable relation generally increases the number of de- 
tectable clusters and tightens constraints from cluster 
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number counts (e.g |19jV In our case, one can image that 
the number of new peaks on 1 arcmin scales, at a given 
peak height, introduced by the presence of noise, depends 
on the amplitude of the re-fluctuations on larger angular 
scales, which, itself, is cosmology-dependent. The pre- 
cise origin of this result will be investigated in a follow-up 
paper. 

Most of the ~ 2,500 peaks, of course, have a low am- 
plitude. In order to compare our results with previous 
work, it is of interest to consider the number of peaks at 
higher significance. As an example, above the commonly 
used threshold of kq — A.ba no i se (for a source galaxy den- 
sity of ng a i = 15 arcmin -2 at redshift z s = 2, the value 
of this threshold corresponds to kq — 0.102 [eq. [9]), the 
average number of peaks we identify in our 200 pseudo- 
independent realizations is N pca ^ s>KG = 47.4 ± 11.8, 
A pcaks>KG = 36 - 6 ± 10 - 4 ' and 7V P<=aks>K G = 54.5 ± 11.4 
for w = —1, w = —0.8 and w — —1.2 respectively, which 
represent ss la differences between the w — — 1 and the 
tii / 1 models. These numbers are with ellipticity noise 
included. Without ellipticity noise, the number of peaks 
above the same threshold is -/Vp ea ks>KG — 25.9 ± 7.3, 
A pcaks>KG = 14 - 8 ± 5 -2, and N pcaks>KG = 37.0 ± 9.5, 
which, again, represent a ltr differences. Although the 
number of these rare high-er peaks is also boosted by the 
presence of noise, apparently this boost does not enhance 
the distinguishing power between cosmologies. 

It is also useful to compare the number of peaks to the 
estimated number of clusters causing a convergence sig- 
nal above the same threshold k,q . To be closer to predic- 
tions in earlier work, for this comparison, we take n ga i — 
30 arcmin -2 , resulting in a threshold of kq — 0.072 for 
1 arcmin smoothing. In our noiseless maps, we obtain 

AWks>K G = 99 - 7 ± 20 -° and ^Poak^Kc = 64.6 ± 14.3 
for w — —1 and w = —0.8 respectively. To find the 
corresponding number of clusters, we repeat the calcu- 
lation in |24j . which is based on the fitting formula for 
the DM halo mass function from [32], and assumes clus- 
ters follow the Navarro- Frenk- White profile [33] . but we 
adjust the cosmological parameters (including the in- 
dependent value of as) to the values adopted here. We 
find the number of clusters is A^ c i us tcrs>K G = 48.8 ± 13, 
and iV c i us tcrs>KG — 31-7 ± 11 for w — — 1 and w = —0.8, 
respectively (the errors bars here are approximately twice 
the VN Poisson errors, enhanced by cosmic variance, fol- 
lowing [44J. Apparently, there are approximately twice 
as many convergence peaks in the maps without ellip- 
ticity noise as there are clusters, indicating a substantial 
number of projections. This is roughly consistent with 
the results of [T5] [TS] [T7] , which self-consistently identify 
halos in their simulations to quantify this correspondence 
more reliably, although those works find a somewhat 
closer correspondence between clusters and 4.5ct peaks. 
Adding ellipticity noise increases the number of peaks by 
another factor of ^two, resulting in plenty of convergence 
peaks that are not due to a single cluster, which can carry 
additional information about the cosmology. 

An important point to note here is that, as we empha- 



sized earlier, we keep the primordial amplitude fixed as 
we vary w, which leads to a different erg for each w. The 
difference in the average number of clusters (48.8 versus 
31.7) quoted above arises essentially entirely from this 
difference in a%. We expect that the difference in peaks 
counts will similarly be driven primarily by as- Tomog- 
raphy should break some of the degeneracy between as 
and w; this is an issue that will be clarified and explored 
in our follow-up work. 



B. Three Peak Types 

It is instructive to next take a more detailed look at 
how the number of convergence peaks depends on the 
peak height, and, in particular, how the counts in dif- 
ferent convergence bins vary with cosmology. In the top 
panel of Figured we show the difference in peak numbers 
(averaged over the 200 realizations) in 3 x 3 degree fields 
in convergence intervals [re, re + Are], with Are = 0.0161 
between the w — —0.8 (blue) and the ACDM models, as 
well as ACDM and w = — 1.2 (green). The middle panel 
in the same figure shows the standard deviation in each 
bin, and the bottom panel depicts the contribution to 
the x 2 an d thus to the distinction power, derived from 
each bin (note that this bottom panel shows the unequal 
re bins that were used in our \ 2 calculation). 

Based on the results shown in this figure, we identify 
three categories of peaks. 

• High Peaks (re >0.06): As is known from previous 
work [TS] [TBI [T7] , a significant fraction (and perhaps 
the majority) of these peaks are caused by individ- 
ual collapsed clusters, which can cause strong peaks 
by themselves. These peaks are more numerous in 
cosmologies with more negative w. This is as ex- 
pected: as mentioned above, the more negative w 
is, the later dark energy starts to dominate the en- 
ergy density, and the more time massive clusters 
had to form. 

• Medium Peaks (0.0 <re <0.06): The number of 
these peaks significantly exceeds the expected num- 
ber of clusters that would cause, by themselves, 
peaks with a similar height. We examined the con- 
tributions of individual lens planes along the line of 
sight (LOS) to the total convergence in these peaks. 
We found that for the majority of these peaks, mul- 
tiple planes contribute significantly, and we there- 
fore suspect these peaks are typically caused by 
projection along the LOS. In contrast with the high 
peaks, these medium peaks are more numerous in 
cosmologies with less negative w. This was unex- 
pected. Furthermore, we find that the difference 
in the counts of these medium peaks dominates the 
cosmological sensitivity. Clarifying the physical na- 
ture of these peaks is important, and will be pre- 
sented in a follow-up paper [451 . 
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FIG. 5: Top Panel: the difference in average peak numbers 
in each convergence bin is plotted as a function of conver- 
gence k. The ACDM cosmology is subtracted from w — —0.8 
(blue) and w = — 1.2 is subtracted from ACDM (green) in this 
plot. The height distribution of convergence peaks in ACDM 
is broader, while in w = —0.8 it is narrower, and more sharply 
peaked (and analogously for w — —1.2 vs. ACDM). The num- 
bers plotted are averaged over 200 realizations of a 3x3 degree 
field, with source galaxies at z 3 — 2, 1-arcminute smoothing 
and the peak heights divided into 50 equal-width convergence 
bins. Middle Panel: Same as the upper panel, but show- 
ing the standard deviation in the number of peaks each bin. 
Bottom Panel/ Contribution to the total {x} 2 from the peak 
number differences in each of the eight convergence bins used 
in our \ 2 calculation. Note that the bins have unequal width, 
chosen such that each bin contains approximately the same 
number of peaks (see text). Note that this panel neglects cor- 
relations between bins, in order to illustrate the n-dependence. 
There is a small contribution from low peaks, a strong one 
from medium peaks, and a sizable one from high peaks. 



• Low Peaks (k <0.0): Similarly to the high peaks, 
these peaks are more numerous in cosmologies with 
more negative w. However, their number difference 
compared to their variance is too low to make an 
appreciable contribution to the distinction between 
the cosmologies (the number difference of the low 
peaks is only a fifth of that of the medium peaks, 
yet their variance still about half). We speculate 
that these peaks typically reside in "convergence 
voids" (large areas of the map with very low values 
of convergence) . We have indeed verified that the 
maps in cosmologies with more negative w show 
a larger fraction in voids, which gives very low- 
amplitude peaks a larger area to reside in. 



C. Distinguishing w ^ -1 Models from ACDM 

We use the above difference in peak numbers in bins 
with different values of k to quantify the statistical dis- 
tinction between the three models with w = —1,-0.8, 
and —1.2. We find that the number of peak height bins 
used has a minor effect, as long as their number is kept 
between 4-9. Below this value, the binning is too coarse 
to capture all the information of the rich structure in the 
top panel of Figure [5] while higher numbers of bins show 
signs of over-binning, especially when several source red- 
shifts and smoothing scales are combined (see Appendix 

As our fiducial set-up, we use a single source plane 
at z s = 2 and 6q = 1 arcmin smoothing, as in the pre- 
vious section. Redshift tomography and using different 
smoothing scales are addressed in Sec. IV E below. The 



X for each of our 3x3 degree convergence fields (with 
ellipticity noise and smoothing), compared to the ref- 
erence cosmology ACDM, is calculated as described in 



X 



IIP This leads to the probability distribution of the 
shown in Figure [2] 

We can derive our statistical conclusions from these x 2 
distributions. For example, the mean of the x 2 distribu- 
tion for w — —0.8 lies at the 95% tail of the x 2 distribu- 
tion for w = — 1. This suggests that the 2a distinction 
from a single 3x3 degree field corresponds to Aw = 0.2. 
The corresponding result for the case of w — —1.2 is an 
exclusion at the 85% confidence level. 



D. Extrapolation to an All— Sky Survey 

While we are limited by computational costs to study- 
ing single 3x3 degree fields, the forthcoming surveys 
listed in § [I] will cover much larger areas of the sky - 
up to w 20,000 degree 2 . Therefore, first we would like 
to extrapolate our results to a larger field of view - or 
equivalently, translate it to the tighter 2a sensitivity to 
w expected from larger fields. 

We will use two methods of extrapolation, an opti- 
mistic and a pessimistic one. In the optimistic case, we 
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neglect large-scale correlations and assume that, with in- 
creasing survey size, the variance per area-squared in 
numbers of peaks just falls off as the number of peaks, 
i.e. a 2 N k oc N pea k- In the pessimistic case, we extrap- 
olate the scaling of the variance with survey size from 
the scaling behavior in subfields of our 3x3 degree weak 
lensing maps. We note that correlations on large scales 
generally fall off faster than the Poisson-like extrapola- 
tion would suggest [30J, and in reality, both cases may 
therefore be conservative estimates. 

We illustrate these two methods of extrapolations in 
Figure [6] where we plot the variance in peak numbers 
for progressively larger fields, starting with a 1.4 x 1.4 
arcminute field and going all the way to 3 x 3 degrees 
(always obtained from our 200 pseudo-independent real- 
izations) . 
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degeneracies with other parameters (see § IV F below). 
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FIG. 6: Variance (per area squared) in the total number of 
convergence peaks, as function of survey area. The red dots 
denote the variance as measured directly from our 3x3 degree 
convergence maps and subfields therein, for the w = — 1 cos- 
mology, with sources at redshift z 3 — 2 and 1 arcmin smooth- 
ing. The green line shows a power-law fit to the four largest 
subfields and an extrapolation to a 20,000 degree 2 field. The 
black line depicts the slightly more optimistic Poisson-like ex- 
trapolation (a% cak = N p eak)> which ignores long-range cor- 
relations. The difference between these two cases causes a 
factor of 1.3 difference in the extrapolated sensitivity to w on 
large scales. 

The optimistic extrapolation, shown by the black line 
in Figure § scales to a ^20, 000/2. 86 2 = 50 x stronger 
sensitivity for a 20,000 degree 2 field of view. The differ- 
ence in the dark energy equation of state parameters in 
our study is Aw = 0.2, which we discern at 2a. There- 
fore, roughly, a 20,000 degree 2 survey will be able to dis- 
cern a difference of Aw = 0.2/50 = 0.004 at the same 2a 
level. The somewhat more pessimistic case of extrapolat- 
ing from existing subfields, as depicted by the green line 
in Figure [6j yields a variance that is larger by a factor of 
1.3. Therefore, for this case, our estimated sensitivity w 
from an LSST size survey is Aw = 0.0052. 

While this is an interesting level of sensitivity, one 
has to keep in mind that this is a single-parameter con- 
straint, and is not directly comparable to marginalized 
constraints usually quoted in the literature which fold in 



E. Different Redshifts and Smoothing Scales 

In our fiducial set-up so far, we have used a single 
source plane (z s = 2) and smoothing scale (0c — 1 ar- 
cmin). Here we examine using other source planes and 
smoothing scales, including the possibility of using mul- 
tiple scales in combination. 



1. Single Redshifts and Smoothing Scale Combinations 

We have re-computed x 2 distributions, following the 
procedure described above, for a variety of different 
smoothing scales (0c = 1,2, 3, 4, 5, 10 arcmin) and source 
redshifts (z s = 1,1.5,2). We quote the results by indi- 
cating, in each case, the fraction of the realizations of the 
w = —0.8 model which can be excluded from the w = — 1 
cosmology at the 68% (95%) confidence level. These frac- 
tions are to be compared to the values of 32% and 5%, 
expected in the absence of any signal (i.e., for the same 
cosmology, compared to itself). These fractions are pre- 
sented in Table 1. We list dc = 0.5' in the table as well, 
but we caution that, as discussed above, our simulations 
do not have sufficient resolution to reproduce the con- 
vergence power spectrum on these scales, and therefore 
these results may not be reliable. Unless noted other- 
wise, we exclude these values from the discussion in the 
rest of this paper. 



% exclusion chance 








@ 68 (95)% CL 


Zg = 1 


z s = 1.5 


z s = 2 


G = 0.5' 


48 (13) % 


76 (32) % 


90 (46) % 


b g = v 


56 (16) % 


68 (18) % 


84 (34) % 


0g = ^' 


53 (12) % 


66 (13) % 


50 (18) % 


6 g =3' 


41 (10) % 


48 (14) % 


40 (6) % 


g = 4> 


39 (6) % 


32 (7) % 


37 (12) % 


G = 5' 


38 (6) % 


28 (8) % 


39 (10) % 


9c = 10' 


26 (2) % 


28 (5) % 


25 (4) % 



Table 1: The probability of distinguishing the w = —0.8 
model from KCDM, based on peak counts in a single 3x3 
degree field, at the 68% (95%) confidence level for various 
source redshifts z s and smoothing scales 0g, using 8 peak 
height bins. 

There are several trends worth noting in Table 1. First, 
using source galaxies at high redshift (z s = 2) pro- 
vides substantial advantage over using sources at redshift 
z s = 1.5 or even z s — 1 (recall that the surface density is 
fixed in each case to be 15 arcmin -2 ). Dark energy con- 
straints from Type la supernovae, and from most other 
techniques also benefit from deep surveys and high red- 
shifts. For weak lensing, going to higher redshifts is par- 
ticularly important, as the best lensing efficiency comes 
from approximately halfway in distance between sources 
and observer. 
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Second, as is well known, weak lensing derives most of 
its power from small scales, and indeed, the smaller the 
smoothing scale in our maps, the stronger the distinction. 
This is true down to the smallest angular scales that we 
can reliable simulate, 6 G k I. Smoothing scales larger 
than 9q = 2' do not provide very strong constraints by 
themselves, but may still be useful when combined with 
smaller ones (especially for real surveys with much larger 
fields of view - the number of peaks at these large scales 
becomes very small for our small 3x3 degree fields). 



2. Correlations 

There are several types of correlations that would be 
useful to investigate: i.e., correlations in the number 
of peaks in different height bins, identified for different 
source redshifts, and with different smoothing scales, as 
well as the cross-correlations between these quantities. In 
order to study the effect of these correlations, we com- 
puted the x 2 distributions from covariance matrices in 
which only diagonal elements, or diagonal blocks were 
retained, and the cross-terms outside these were set to 
zero. 

We find that the peak height bins are correlated. Ne- 
glecting all correlations, by retaining only the diagonal 
elements of the covariance matrix, lead to \ 2 histograms 
that resemble % 2 distributions with fewer degrees of free- 
dom, and an excessive amount of outliers, both typical 
signs that the dataset contains more correlations than 
is assumed in the analysis. In particular, as expected, 
the y 2 distributions (analogous to those shown in Fig- 
ure p2| develop wider tails toward high x 2 values, with 
their maximum moving to somewhat smaller values. The 
net result is that the overlap of the x 2 distributions be- 
tween a pair of cosmologies is enhanced, and the dis- 
tinction between the cosmologies is degraded. Table 2 
illustrates this in numbers. 



the number of elements in the covariance matrix grows 
quickly. Because of the low number of realizations and 
the corresponding underestimation of the variance, re- 
sults obtained by using more than ss 9 bins become un- 
reliable (Appendix VI I . Unfortunately, this prohibits us 
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z s = 1 


z s = 1.5 


z s =2 


e G = 0.5' 


32 (1) % 


46 (2) % 


52 (2) % 


e G = v 


34 (2) % 


54 (2) % 


65 (8) % 


G = 2' 


40 (3) % 


51 (5) % 


44 (8) % 


G = 3' 


39 (2) % 


44 (6) % 


38 (4) % 


e G = 4' 


36 (2) % 


30 (6) % 


32 (5) % 


G = 5' 


32 (4) % 


32 (3) % 


36 (6) % 


9 G = 10' 


24 (2) % 


32 (4) % 


26 (3) % 



Table 2: Same as Table 1, but with correlations be- 
tween peak height bins neglected. The likelihood of dis- 
tinguishing w = —0.8 from KCDM are much lower, and 
approach the no- significance values of 32 (5) %, in par- 
ticular for the 95% confidence level numbers, indicating 
a large number of outliers. 

We also attempted to study cross-correlations between 
different redshifts and smoothing scales. When study- 
ing several redshifts and smoothing scales simultaneously, 



from studying the effect of correlations between redshifts 
and smoothing scales reliably: the number of different 
peak height bins still allowed becomes too small. In 
particular, when multiple source redshifts or smoothing 
scales are simultaneously included, only 2-3 peak heights 
are allowed by the low number of realizations. We found 
that such a low number of convergence bins does not suf- 
ficiently capture the peak number difference distributions 
in Figure [5] and no meaningful constraints are possible. 
We defer a study of these cross-correlations to a follow - 
up paper, with a larger number of independent realiza- 
tions of each cosmology. 



F. Comparison to Other Forecasts for LSST 

It is useful to place our results in the context of con- 
straints expected to be available from other uses of large 
WL maps. Three methods we will briefly mention here 
are (i) the shear power spectrum, (ii) cluster number 
counts, and (iii) the one-point function of the conver- 
gence PDF. 

(i) From an 11-parameter fit to the tomographic shear 
power spectrum from LSST and constraints from Planck, 
[33] obtain a(w ) = 0.06,cr(w a ) = 0.09. 

(ii) From a 7-parameter fit to the number counts of 
200,000 shear-selected clusters, combined with Planck 

constraints, [21] obtain a(wo) = 0.04, a(w a ) = 0.09. 

(iii) Using the fractional area statistic of areas of high 
convergence, [3D] found a(w ) — 0.043, a(w a ) = 0.11 for 
LSST and priors from Planck. 

The above are all la constraints, marginalized over a 
large number of parameters. We do not attempt a fair 
comparison of the three methods, which would require 
that identical assumptions are made about the survey 
specifications and about the set of free parameters (see 
[lj for a similar exercise covering i. and ii.). Neverthe- 
less, these numbers demonstrate, first, that the statistical 
information contained in the non-linear features in WL 
maps - captured by (ii) and (iii) - is at least roughly com- 
parable to the information encoded in the power spec- 
trum in (i). Second, our single-parameter sensitivity 
to w can be compared meaningfully to those from clus- 
ters counts. In particular, as mentioned in § |IV A| the 
counts of clusters above the 4.5k g convergence thresh- 
old differs in the w = —0.8 and w = — 1 cosmologies by 
« 1.5cr. We find that the number of convergence peaks 
above the same threshold differs in these two cosmolo- 
gies at a similar, ss la significance level. We find further 
that considering lower height peaks, with convergence 
well below this threshold, significantly improves the dis- 
tinction. While follow-up work, with simulations that 
address degeneracies between w, as, and other parame- 
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ters is needed, these results suggest that the parameter 
sensitivity from WL peak counts will be competitive with 
those from the above three methods. 

As mentioned in § I] in a recent preprint, 5T] inde- 
pendently addressed the dependence of WL peaks on the 
background cosmology. |31j used the same simulation 
box size as in our work, but a smaller number of par- 
ticles (256 3 , rather than 512 3 ), which allowed them to 
sample a 2D parameter space and initial conditions, at 
the cost of mass and angular resolution. They compute 
the aperture mass for the peaks as a weighted integral of 
the tangential shear, rather than the convergence. De- 
spite these differences, the conclusions of the two papers 
appear to be remarkably similar. In particular, if we at- 
tribute the distinction we find between our cosmologies 
with different w values entirely due to the differences in 
erg in these cosmologies, we find a very similar sensitivity 
to the latter parameter as their Figure 3: 0.73 <<7s <0.83 
(for a fixed f2 m = 0.26). We find, additionally, a sub- 
dominant type of peaks - low-amplitude peaks, residing 
in the voids - the distinction between the cosmologies 
is however dominated by medium peaks, with a strong 
contribution from high peaks. 



guished from ACDM with a single 3x3 degree conver- 
gence field at a 95% and 85% confidence level, respec- 
tively. Extrapolating these results to the 20,000 degree 2 
field of view of a large survey, such as LSST, we obtain a 
projected sensitivity to w of Aw = 0.004-0.0052, depend- 
ing whether the extrapolation is based on the variance 
measured in subfields of our maps, or whether a simple 
Poisson-like scaling is used. 

In this work, we have explored three constant values 
of the dark energy equation of state, to = —0.8, w = —1, 
and to = —1.2, representing variations around the best fit 
ACDM model to the 5-year WMAP data. The most sig- 
nificant concern is that there will be strong degeneracies 
between w and other parameters; our results may already 
be driven primarily by the as sensitivity in the number of 
peaks. In a followup paper, we will study such degenera- 
cies; it will also be necessary to perform a larger number 
of realizations of each cosmology, in order to study the 
utility of combining several source galaxy redshifts and 
to use information possibly contained in the angular size 
distribution of the peaks. In a separate publication |45j . 
we also plan to investigate in more detail the physical 
origin of the three kinds of peaks detected in this work. 



V. SUMMARY 

We propose the new method of using the counts of 
peaks, identified in weak gravitational lensing maps, to 
constrain dark energy and other cosmological parame- 
ters. The method makes no reference to cluster counts 
or their mass function, and so, by definition, is free from 
selection effects inherent in methods that target clusters. 

We used N-body simulations to create theoretical con- 
vergence maps in three cosmologies with w = —0.8, 
to = —1, and w = —1.2. We identify three different peak 
types: high, medium, and low peaks, whose numbers de- 
pend differently on cosmology. The high (k >0.6) and 
low (k <0) peaks both become more numerous with in- 
creasingly negative to, while the opposite trend is true for 
the medium peaks. For the high peaks, this dependence 
confirms the expectation that when w is more negative, 
dark energy begins to dominate later, and more time is 
allowed for the growth of non-linear structures. The op- 
posite behavior of the medium peaks is a surprise - which 
is all the more important since we find that these peaks 
dominate the distinction between the cosmologies. The 
majority of these peaks are not typically caused by in- 
dividual collapsed clusters, and are more likely due to 
projection of large scale structure along the line of sight. 
The low peaks form a subdominant contribution. These 
peaks are typically due to random ellipticity noise, and 
we speculate that the larger surface area of voids in cos- 
mologies with more negative to, which we have detected 
in the maps, offers more opportunities for these small 
fluctuations to arise. 

We have shown that based on these peak counts, cos- 
mologies with to = —0.8 and w = —1.2 can be distin- 
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VI. APPENDIX: INITIAL CONDITIONS AND 
THE NUMBER OF REALIZATIONS 

In this Appendix, we show our results as a function 
of the number of peak height bins used, and justify the 
choice for the number of bins we made in the body of the 
paper. 
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Ideally - in the limit that the covariance matrix was 
perfectly computed - the distinguishing power would 
asymptotically approach a constant value as one in- 
creases the number of bins, at which point using more 
bins will not provide any more benefit. Unfortunately, it 
is computationally expensive to generate a large ensem- 
ble of initial conditions from which to construct several 
strictly independent realizations of a given cosmology (a 
single realization takes approximately 4800 CPU hours, 
i.e. ca. 2.5 days on 80 CPUs). This, together with the 
storage required for the large amount of data produced 
in each run, ultimately limits the number of bins we can 
utilize. 

In particular, using too few realizations, together with 
too many bins, will generally result in an underestimate 
of the variance, and an artificial boost in the \ 2 values. 
The possible concern, then, is that the differences we find 
in the convergence maps between two cosmologies may 
be caused by such an underestimation of the variance. 

In order to guard against this possibility, and to verify 
that the differences in the convergence maps are caused 
by genuine differences in cosmology, we generated a sec- 
ond set of independent initial conditions for the cosmolo- 
gies with w = — 1 and with w — —0.8. The simple Ansatz 
we adopt for the maximum number of bins that can be 
safely used, without underestimating variances, is the 
following: when two different realizations of the same 
ACDM cosmology are compared, we should recover the 
X 2 distribution, i.e., we must not be able to rule out the 



fiducial model itself. 

In Figure [7J we show the percentage of the realizations 
of the w = —0.8 test cosmology that lies beyond the 68% 
(left panel) and 95% (right panel) tail of the \ 2 distri- 
bution (solid curves), together with the same quantities 
for the fiducial ACDM cosmology itself (dashed curves). 
Our Ansatz requires that the dashed curves correspond 
to the confidence level at which the exclusion is to be 
achieved (e.g. no more than 32 / 5% of maps excluded 
at the 68 / 95% confidence level). We see that this con- 
dition is satisfied when 4-9 bins are used, but for 10 or 
more bins, there is a steadily rising difference between 
the two realizations of the ACDM model. We therefore 
take 9 bins as the maximum acceptable total number of 
bins. 

Figure [7] also shows that the distinction between w = 
—0.8 and the ACDM model is relatively steady for 4-9 
bins, and is insensitive to the choice of the realization 
of the initial condition. However, for a larger number 
of bins, the distinction starts to be either less significant 
(when different realizations of the initial conditions are 
used in the two cosmologies; solid curves), or begin a 
modest rise in significance (when the same realizations 
are used; dotted curves). 

While the above analysis is based only on the number 
of convergence bins, we find similar conclusions when the 
bins include multiple source galaxy redshifts and / or mul- 
tiple smoothing scales. We conclude that the accuracy of 
our covariance matrix limits us to a total of 9 bins. 
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FIG. 7: The solid curves show the percentage of maps in the w = —0.8 cosmology which can be excluded at the 68% (left panel) 
and 95% (right panel) confidence level from ACDM. In these comparisons, the maps in the w = —0.8 cosmology were generated 
from initial condition A, which were compared to the expected average in the ACDM cosmology with initial condition B; the 
covariance matrix was calculated from ACDM with initial condition A. The dotted curves show the same quantities, except that 
the maps in the w = —0.8 model were taken from initial condition B, and those from ACDM with initial conditions A. The 
dashed curves again show the same quantities, except maps from the ACDM model (with initial condition B) were compared 
against ACDM with initial condition A and covariance matrix calculated from initial condition A. If the covariance matrix was 
correctly estimated, the latter curve should be fiat, whereas it is begin to rise systematically for 10 or more bins. Based on this 
result, we identify 9 as the maximum number of bins that can be safely used without underestimating variances. 
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